首页> 外文OA文献 >Classification vs. Regression in Supervised Learning for Single Channel Speaker Count Estimation

【2h】

Classification vs. Regression in Supervised Learning for Single Channel Speaker Count Estimation

机译：单通道监督学习中的分类与回归说话人数估计

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

The task of estimating the maximum number of concurrent speakers from singlechannel mixtures is important for various audio-based applications, such asblind source separation, speaker diarisation, audio surveillance or auditoryscene classification. Building upon powerful machine learning methodology, wedevelop a Deep Neural Network (DNN) that estimates a speaker count. While DNNsefficiently map input representations to output targets, it remains unclear howto best handle the network output to infer integer source count estimates, as adiscrete count estimate can either be tackled as a regression or aclassification problem. In this paper, we investigate this important designdecision and also address complementary parameter choices such as the inputrepresentation. We evaluate a state-of-the-art DNN audio model based on aBi-directional Long Short-Term Memory network architecture for speaker countestimations. Through experimental evaluations aimed at identifying the bestoverall strategy for the task and show results for five seconds speech segmentsin mixtures of up to ten speakers.

机译：从单声道混合中估计同时发言的最大人数的任务对于各种基于音频的应用非常重要，例如盲源分离，说话者二值化，音频监视或听觉场景分类。基于强大的机器学习方法，我们开发了一个深度神经网络（DNN）来估计说话者人数。尽管DNN有效地将输入表示映射到输出目标，但仍不清楚如何最好地处理网络输出以推断整数源计数估计，因为离散计数估计可以作为回归或分类问题来解决。在本文中，我们研究了这一重要的设计决策，并解决了补充参数的选择，例如输入表示。我们评估基于双向长期短期记忆网络架构的最新DNN音频模型，以进行扬声器估计。通过旨在确定最佳总体策略的实验评估，并在多达十个说话者的混合语音中显示五秒钟语音片段的结果。

著录项

作者
Stöter, Fabian-Robert; Chakrabarty, Soumitro; Edler, Bernd; Habets, Emanuël A. P.;
展开▼
作者单位

展开▼
年度 2017
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Decision Forests: A Unified Framework for Classification, Regression, Density Estimation, Manifold Learning and Semi-Supervised Learning [J] . Antonio Criminisi, Jamie Shotton, Ender Konukoglu Foundations and Trends in Computer Graphics and Vision . 2011,第2a3期

机译：决策森林：分类，回归，密度估计，流形学习和半监督学习的统一框架
2. A novel logistic regression model combining semi-supervised learning and active learning for disease classification [J] . Hua Chai, Yong Liang, Sai Wang, Scientific reports. . 2018,第1期

机译：半监督学习与疾病分类的新型逻辑回归模型
3. Supervised and semi-supervised learning in text classification using enhanced KNN algorithm: a comparative study of supervised and semi-supervised classification in text categorisation [J] . M. A. Wajeed, T. Adilakshmi International Journal of Intelligent Systems Technologies and Applications . 2012,第3a4期

机译：使用增强型KNN算法的文本分类中的有监督和半监督学习：文本分类中有监督和半监督分类的比较研究
4. Classification vs. Regression in Supervised Learning for Single Channel Speaker Count Estimation [C] . Fabian-Robert Stoter, Soumitro Chakrabarty, Bernd Edler, IEEE International Conference on Acoustics, Speech and Signal Processing . 2018

机译：单通道扬声器计数估计监督学习中的分类与回归
5. A Switching Regressions Framework for Models with Count-Valued Omni-Dispersed Outcomes: Specification, Estimation and Causal Inference [D] . Manalew, Wondimu Samuel. 2020

机译：具有计数型全分散结果的模型的切换回归框架：规范，估计和因果推断
6. A novel logistic regression model combining semi-supervised learning and active learning for disease classification [O] . Hua Chai, Yong Liang, Sai Wang, -1

机译：半监督学习与主动学习相结合的新型逻辑回归模型用于疾病分类
7. CASA BASED SUPERVISED SINGLE CHANNEL SPEAKER INDEPENDENT SPEECH SEPARATION [O] . M.Fazal Ur Rehman 2019

机译：基于CASA的监督单通道扬声器独立语音分离

Classification vs. Regression in Supervised Learning for Single Channel Speaker Count Estimation

摘要

著录项

相似文献

相关主题

期刊订阅